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Abstract. We study statistical properties of galaxy structures in several samples extracted from the 2dF Galaxy 
Redshift Survey. In particular, we measured conditional fluctuations by means of the scale-length method and 
determined their probability distribution. In this way we find that galaxy distribution in these samples is charac- 
terized by large amplitude fluctuations with a large spatial extension, whose size is only limited by the sample's 
boundaries. These fluctuations are quite typical and persistent in the sample's volumes, and they are detected in 
two independent regions in the northern and southern galactic caps. We discuss the relation of the scale-length 
method to several statistical quantities, such as counts of galaxies as a function of redshift and apparent mag- 
nitude. We conflrm previous results, which have determined by magnitude and redshift counts that there are 
fluctuations of about 30% between the southern and the northern galactic caps and we relate explicitly these 
counts to structures in redshift space. We show that the estimation of fluctuation amplitude normalized to the 
sample density is biased by systematic effects, which we discuss in detail. We consider the type of fluctuations 
predicted by standard cosmological models of structure formation in the linear regime and, to study nonlinear 
clustering, we analyze several samples of mock-galaxy catalogs generated from the distribution of dark matter in 
cosmological N-body simulations. In this way we conclude that the galaxy fluctuations present in these samples 
are too large in amplitude and too extended in space to be compatible with the predictions of the standard models 
of structure formation. 
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1. Introduction 



In one of his seminal papers de Vaucouleurd (197^ put 
into a historical perspective the problem of galaxy large- 
scale structures and the question about the scale where 
galaxy distribution turns to homogeneitjQ- He points out 
that observations have first found that galaxies are not 
randomly distributed, then that in the fifties the same 
property was assigned to cluster centers, and finally that 
at the end of the sixties the discovery of super-clusters 
has still enlarged the scale of structures in the universe, 
thus pushing the scale where the approach to homogeneity 
occurs to larger and larger scales. 

In the past twenty years many observations have 
been dedicated to th e study of the large-scale dis- 
tributions of galaxies (jHuchra et al.l . Il983t iFalco et al 



^ Note that de Vaucouleurs simply considered the scale where 
galaxies approaches a random distribution. A distribution can 
be homogeneous, i.e. with a well-defined mean density, without 
being random; i.e. there can be small amplitude fiuctuations 
with weak long-range correlations. 



19991: iGiovanein and Havnesl Il993l: Ida Costa et"all. ll98S 



Shectman et all . Il996t lYork et all l20o6[ iColless et al 
20011 ). Despite the fact that large-scale galaxy struc- 
tures, of about several hundred Mpc/h size □, have 



been observed (Ide Lapparent Huchra and Geller . 
Geller and HuchraT 19891 : Giovanelli and Hav: 



nes 



1986 



1993 



Gott et al.l . 120051 ) to be the typical feature of the distri- 
bution of visible matter in the local universe, the statis- 
tical analysis measuring their properties has identified a 
characteristic scale that has only slightly changed since 
its discovery forty years ago in angular catalogs. This 
scale. To, was measured to be the one at which fluctua- 
tions in the galaxy density field are about twice the value 
of the sample density and it was indeed determined to 
be ro ~ 5 Mpc/h in the Shane and Wirtanen angular 



catalog ( Totsuii and Kihara . 1969f). Subsequent measure - 



ments of this scale — 

( Tgsih : 



Davis et al 



see e.g~lDavis and Peebles! (Il983l) 



Park et all (|l994 ): iBenoist et al 



^ We use Ho 
constant. 



lOO/i km/sec/Mpc for the value of the Hubble 
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(Il996l) : iNorberg et all (|200ll . l2002bl) . IZehavi et all (|2002 . 
120041) — found a similar value, although in several sam- 
ples larger values of rg have been found (i.e., tq ~ 6 — 12 
Mpc/h). This variation wa s then ascribed to a luminosity 
depen d ent effect — see e.g. Davis et al. (Il988l): Park et al 



( 1994 : iBenoist et all (|l996l ): IZehavi et all (|2002^" 

Theoretical models of galaxj;^ formation, like the cold 
dark matter (CDM) one (see IPeacock. 1999) are able to 
predict the scale vq once it is given the amplitude and cor- 
relation properties of fluctuations of the initial conditions 
in the early universe. The normalization of the matter 
initial condition can be obtained by measuring the ampli- 
tude and correlation properties of the anisotropics of the 
cosmic microwave background radiation (CMBR). Then 
by calculating the evolution of small density fluctuations 
in the linear perturbation analysis of a self-gravitating 
fluid in an expanding universe, it is possible to predict 
the scale tq today. This turns out, in curr ent models such 
as th e CDM ones, to be tq 

200i). On scales r < vq models are unable to make pre- 



cise predictions of the shape of the correlation function 
because gravitational clustering in the nonlinear regime 
is difficult to be treated. Gravitational N-body simula- 
tions are then used to investigate structure formation in 
the nonlinear phase. In addition, given that models pre- 
dict, for r > To, a precise type of small amplitude fluc- 
tuations, it is possible to simply relate, by using the lin- 
ear perturbation analysis mentioned above, the proper- 
ties of fluctuations in the present matter density field to 
those in the initial conditions. In a certain range of scales 
greater than rg, small amplitude, fluctuations should have 
still positive correlations. Particularly, for ro < r < Tc, 
fluctuations have v ery small amplitude and weak posi- 
tive correlations (see Svlos Labini and Vasilvev , 20081 ) . On 
even larger scales r > Tc (where this is estimated from 
CMBR measurements to be Tc w 100 Mpc/h), all mod- 
els predict that the matter density field presents anti- 
correlations that tend to zero wit h a (negative) power- 
law behavior of the type — 



(|Gabrielli et all . 120021: 



Svlos Labini and Vasilvevl . l2008f ). This negative power-law 
tail corresponds in real-space to the linear dependence of 
the matter power-spectrum (PS) on the wave-number; i.e., 
P{k) ~ k. The former represents a behavior that can be 
interpreted as a consistency requirement for the properties 
of densi ty fluctuations in Fr iedmann-Robertson- Walker 



models (|Gabrielli et al.l . 120021 ). Because of the change in 
sign of the correlation function at Tc, this length-scale rep- 
resents the cut-off in the size of weak amplitude structures 
in standard models. Thus, in the regime where fluctua- 
tions are small and have weak positive correlations; i.e., 
for To < r < Tc, the present matter-density field reflects 
the imprint of the initial conditions. 

The fundamental test for current models of galaxy 
formation then concerns whether density fluctuations on 
large scales (i.e., r > 10 Mpc/h) have small ampli- 
tude or not. Another important question concerns the 
detection of anti-correlations on scale s of r > Tc ~ 
100 Mpc/h (|Svlos Labini and Vasilve"vl l2008f) . The pri- 



mary problem to be considered in this respect con- 
cerns the statistical methods used to measure the am- 
plitude of fluctuations and the range of correlations. 
There has been intense debate in the past decade 



concerning this crucial point dSylos Labini et al 
IWu et all. Il999l: iHogg et al.l. l2005t iGabrielh et al 



1998 



2005 



bar vshev and Teerikorpill2005H Svlos La bini et "311120081 ) 

Before one determines the amplitude of fluctuations in a 
given volume with respect to the sample density, one must 
have firstly tested that the former quantity is stable; i.e., 
that it does not depend on the sample size and/or it does 
not present large fluctuations in different samples contain- 
ing the same type of objects. Indeed, in case the distribu- 
tion presents structures and fluctuations on all scales in a 
given sample (i.e., it is inhomoge neous) the sample density 



is not a well-defined descriptor l) Svlos Labini et al.l 119981 . 
]2008) . In this situation all statistical quantities that are 
normalized to the sample density are affected by system- 
atic effects. For this reason, prior to the characterization 
of fluctuations with respect to the sample density, a fun- 
damental test c onsists in measuring conditional correla- 
tion properties (jCabrielli et al.l . l2005h . It has been found 
that conditional statistical quantities, such as the condi- 
tional number of points in spheres, indeed show scaling 
properties on small scales r < 20 Mpc/h, e.g., the for- 
mer grows as a function of di s tance more slowly than 
the volume (ISvlos Labini et all. Il998t [Hogg et al.l. 12005 



IVasi lvev et al.ll2006HSvlos Labini et al.ll2007l . l200a) . This 
result implies that unconditional quantities are affected by 
systematic finite-size effects and thus do not give a reliable 
and meaningful estimation of correlations and amplitude 
of fluctuations. In this situation the length scale rg can be 
an artifact of a statistical analysis, which assumes that the 
sample average is a meaningful estimation of the asymp- 
totic density; i.e., it assumes that the distribution is ho- 
mogeneous and that fluctuations have a small amplitude 
well inside the sample volume. 



While estimations of real-space correlation properties 
can be affected by finite-size effects, this is not the case 
when one counts galaxies as a function of redshift or 
apparent magnitude. In this case indeed one does not 
normalize statistical quantities to the sample average, 
and large fluctuations have been fo u nd bo th in redshift 
(|Kerscher et all. Il999t IChiaki et all 120031) and angular 
surveys (see Picardl 199lt FHth et al.l 12003 ). In partic- 
ular, in a CCD survey of bright galaxies within the north- 
ern and southern strips of the 2dF Galaxy Redshift Survey 
(2dFGRS) conclusive evidence is found of fluctuations of 
~ 30% in galaxy counts as a function of apparent mag- 
nitude (jBusswell et al.l . l2004) . Since in the angular region 
toward the southern galactic cap (SGC) a deficiency, with 
respect to the northern galactic cap (NGC) in the counts 
below magnitude ~ 17 (in the B filter) was found, persist- 
ing over the fuU area of the APM and APMBGC catalogs, 
this would be evidence that there is a large void with a 
radius of about 150 Mpc/h, implying that there is more ex- 
cess large-scale power than detected in the 2dFGRS corre- 
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lation functioijl (jNorberg et al.l . [200lll2002b ) or expected 
in the CDM models. It is indeed evident that, because of 
the difference in the counts' amphtude, and thus in the 
sample density, between the NGC and the SGC samples, 
any estimation of the sample density is not stable. Thus 
the problems for the normalization of fluctuations ampli- 
tude to the estimation of the sample density should be 
studied in great detail. 

In this paper we use the 2dFGRS to study fluctua- 
tions in galaxy distribution on large scales and to deter- 
mine their statistical properties in redshift and magnitude 
space. Our aim is to employ statistical descriptors that do 
not, implicitly or explicitly, make use of the normalization 
of fluctuations to the sample average, thus avoiding the 
a-priori assumption of homogeneity inside a given sam- 
ple. Thus we determine conditional statistical properties, 
ther eby expanding our previous findings in th i s sam e sur- 
vey (|Vasilvev et al.l l2006t ISvlos Labini et all . l2009t ). We 
find that the puzzle of the coexistence of difference in den- 
sities in the NGC and SGC volumes on large spatial scales 
with a relatively small typical length scale of a few Mpc, 
can be understood as due to finite size effects in the esti- 
mation of the correlation function. 

The paper is organized as follows. In Sect [2] we discuss 
the 2dFGRS data and the procedure used to construct the 
sub samples for the statistical analysis. The methods for 
characterizing homogeneous and heterogeneous distribu- 
tions are briefiy reviewed in Sect (31 In Sect[¥]we present 
the main results of the analysis and consider in detail 
the problems related to estimating fiuctuations' ampli- 
tude normalized to the sample density. The properties of 
fluctuations of the matter density fleld predicted by the- 
oretical models, in the linear regime, are briefly reviewed 
in Sectini Then, to study expected nonlinear fluctuations 
in standard theoretical models, we analyze the properties 
of mock-galaxy catalogs, generated from the dark matter 
density fields stemming from a cosmological N-body sim- 
ulation — the Millennium Run by .Spr ingcl et al. (2005). 
Finally in Sect [H] we discuss the results of the analysis of 
the 2dFGRS samples and outline our main conclusions. 

2. The 2dFGRS samples 

The 2dFGRS (jColless et all . l200l[) measured redshifts for 
more than 220, 000 galaxies in two strips in the SGC and 
in the NGC. The median redshift is z ~ 0.1. The appar- 
ent magnitude corrected for galactic extinction in the bj 
filter is fimited to 14.0 < bj < 19.45. The selection of 
the samples used in the analysis is described in detail in 
Vasilyev et al. I (|2006h . Here we briefly summarize the main 
points. 

— To avoid the effect of the irregular edges of the survey, 
we selected two rectangular regions: in the SGC there 
is a sfice of size 84° x 9° hmited by -33° <S < -24°, 



^ These statistics, as mentioned above, normalize the ampli- 
tude of fluctuations to the estimation of the sample density. 
From it the length scale ro ~ 6 — 8 Mpc/h is derived. 



VL sample 




Rmax 






N, 


SGC400 


100 


400 


-20.8 


-19.0 


29373 


NGC400 


100 


400 


-20.8 


-19.0 


23208 


SGC550 


150 


550 


-21.2 


-19.8 


26289 


NGC550 


150 


550 


-21.2 


-19.8 


18030 



Table 1. Main properties of the obtained VL samples. 
Rmin , Rmax are the chosen limits for the metric distance; 
Mmim Mmax are the corresponding limits in the absolute 
magnitude; Ng is the number of galaxies in the sample. 



-32° < a < 52°, while the NGC slice is smaller, i.e., 
60° X 6°, with limits -4° < 5 < 2°, 150° < a < 210° 
(coordinates are equatorial). The solid angles are SI — 
0.20,0.11 steradians for the SGC and the NGC slices. 
We selected galaxies in the redshift interval 0.01 < 
z < 0.3, with redshift quality parameter larger or 
equal to three, in ord er to get high quality redshifts 



( Hawkins et"an . l2003f) . 



We did not use a correction for the redshift- 
completeness mask and for the fiber collision effects. 
In fact, completeness varies mostly near the survey 
edges, which are excluded in our sample. We assumed 
that fiber collisions do not make a noticeable change 
in the small-scale correlation properties given that we 
set our lower cut-off to 0.5 Mpc/h, which is la rger than 
the 0.1 Mpc/h used bv lHawkins et all (|2003h . 
The metric distance is usually computed as in 
Zehavi et all (|2002h : 



R(z) 



c 



dy 



y- (fiM/y + ^A -y^) 



1/2 



(1) 



where we used the standard model parameters f2j\/ = 
0.3 and Ha = 0.7, and c is the light speed. 
The absolu t e ma gnitude was computed as in 
Zehavi et al"] (|2002h : 



M^bj-5- logio [R{z) ■ (1 + z)] - Kiz) - 25 , (2) 



where K{z) is the K-correction term ( Hogg et al.l . 
2002h 

To calculate the K-correction K(z) we used relations 
obtained b v iMadgwick et al and we applied 

them as in Vasilyev et al.l ( 2006f K 
The volume-limited (VL) samples were identified by 
two limits in the metric distance Rmin < R < Rmax 
and two corresponding limits in the absolute magni- 
tude Mmin and Mmax- In Table [T] we report the prop- 
erties of the VL samples. 



It is known that the completeness-mask for the sur- 
vey includes the effects not only of redshift incomplete- 
ness (which varies both with field and with apparent 
magnitude), but also of the variations in the faint lim- 
iting magnitude across the survey regions. In order to 
test the robustness of our results, without correcting di- 
rectly for the incompleteness using ad-hoc assumptions. 
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VL sample 


Rmin 


Rmax 








SGC400 


100 


400 


-20.6 


-19.2 


22872 


NGC400 


100 


400 


-20.6 


-19.2 


18407 


SGC550 


150 


550 


-21.1 


-20.0 


18890 


NGC550 


150 


550 


-21.1 


-20.0 


13495 



ditions when they hav e zero average density in the infinite 



Table 2. As Tabled] but for the case of more conservative 
cuts in apparent magnitudes; i.e. 14.5 < bj < 19.3, were 
used for selecting the galaxies 



we restricted the samp le to the apparent ma gnitude lim- 



its 14.5 < bj < 19.3 ([Hawkins et all . |2003[ ). We report 



in Table [2] the characteristics of the VL with these more 
conservative cuts. 

As a self-consistent test, we note below that the statis- 
tical analysis in the samples constructed with less con- 
servative cuts agree with previous determinations for 
what concerns the galaxy count s as a function of ap- 



2002a: Busswell et al 



paren t magnitude (jNorberg et al. 

the redshift distribution (iRatcliffe et al.1 



19981 : 



Busswell et al. . 20041). and the standard two -point corre- 



lation function (jNorberg etliD . l200ll . l2002bh 



3. Statistical methods 

In this section we review the main properties of stationary 
stochastic point processes. These include both the ones 
that have a strictly positive ensemble average density and 
those which have it equal to zero. This discussion clarifies 
what the useful statistical methods are in both cases for 
analyzing a fini te sample. A more ex haustive treatment 
can be found in iGabrielli et aL (2005). 

3.1. Volume average, ensemble average, and 
self-averaging property 

The problem of the statistical characterization of correla- 
tions and fluctuations of a stochastic distribution of points 
in a finite sample of volume V can be rephrased as the 
problem of measuring volume-averaged statistical quan- 
tities. The basic issue concerns whether or not these are 
meaningful descriptors, i.e., whether or not they give sta- 
ble statistical estimations of ensemble averaged quantities. 
In this respect one has to consider various problems that 
maybe clarified after the definition of the general proba- 
bilistic properties of stochastic point processes. 

First of all, we need to define the ensemble proper- 
ties. In general it is assumed that galaxy distribution is a 
stationary stochastic process, which means that it is sta- 
tistically, translationally, and rotationally invariant; i.e., 
it satisfies the condition of spatial statistical isotropy and 
homogeneity in order to avoid special points or direction^ 
Stationary stochastic distributions also satisfy these con- 



Because of the hypothesis of statistical stationarity, sta- 
tistical quantities generally depend only of the scalar distance 
between points. 



volume limit ( Gabrielli et al. . 20051 ) 



Due to the assumption of ergodicity (i.e., the ensemble 
average is equal to the infinite volume average), the ex- 
istence of a well-defined average density implies that, for 
a single realization of the mass distribution, the following 
limit is well-defined: 

/ n{r)dr = no ,Wxo . (3) 

'V{R,xo) 

= 4ttR^ /3 is the volume of the sphere of 



lim 

R^oo V{R; Xf)) 



where V{R,Xo) 
radius i?, centered on an arbitrary point Xq . The constant 
riQ is strictly positive for homogeneous (or uniforrr(f|) dis- 
tributions and zero for inhomogeneous ones (e.g. fractals). 
It is then clear that all statistical quantities that are nor- 
malized to riQ are meaningless when this is equal to zero 
in the ensemble average or in the infinite volume limit. 

Keeping these mathematical properties in mind we 
have to consider the situation occurring when in a finite 
sample, of size ~ V^^^, the distribution is not homoge- 
neous. In this case the estimator of the average mass den- 
sity gives a large relative error with respect to the ensem- 
ble value making it systematically biased. This implies 
that only statistical averages conditioned to the fact that 
the origin of coordinates is a point of the set are well- 
defined. 

Clearly a distribution can be inhomogeneous up to a 
length scale aJI and homogeneous for r > Xq. Then for 
r < Ao the distribution is characterized by large fluctua- 
tions, and the average density is not a well-defined quan- 
tity if it is estimated in samples of size V^^^ < Xq. Instead 
density fluctuations are small for r > Xq and the sample 
density converges to the asymptotic (or ensemble) value 
when the sample volume is such that V^^^ > Xq. The 
precise behavior of this convergence is determined by the 
(weak) two-point correlation properties of the distribu- 
tions, and the c onvergence will be slow er when correlations 



are long-range (|Gabriem et al■l ■ l2005^ ■ 



3.2. Probability distribution of conditional fluctuations 

In lSvlos Labini et al.l (l200a ) we have introduced the scale- 
length (SL) analysis. This consists in determining the 
number N{r]Ri) of galaxies in spheres of radius r, cen- 
tered on the i*'* galaxjf^ whose distance from the origin is 
Ri'. that is. 



N{r;R,) 



n{s)ds , 



V{r;Ri) 



(4) 



where the integral is performed over the spherical volume 
V{r; Ri) of radius r centered on the i*'' galaxy at distance 



^ To avoid confusion a stationary stochastic point process 
with a positive ensemble average density is sometimes denoted 
as uniform and nonuniform when this is equal t o zero. Both are 
statistically homogeneous stochastic processes (|Gabrielli et al.l . 
.2005 ). 

^ See below for a precise definition. 
^ This is thus a conditional quantity. 
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Ri from the origin, and n{s) is the microscopic number 
density, which is a function of the space position s . 

When EqlH is averaged over the whole sample, it gives 
an estimate of the average conditional number of galaxies 
in spheres of radius r 



M(r) 



N{r) 



M{r) ^ 



(5) 



■th 



where the sum is extended to the M{r) galaxies, the 
at radial distance Ri from us, whose separation from the 
boundaries of the sample is less than or equal to r. In this 
way when r grows the number of values M{r) over which 
the mean in Eq|5] is performed decreases with r, because 
only those galaxies for which the sphere is fully included 
in the sample volume are considered as centers. 

This estimator , known as the full - shell estimato r 
(ISvlos Labini et al.l . [l99ilKerscheilll999HGabrieni et al 



20051) . has the advantage of making the weakest a-priori 



assumptions about the properties of the distribution out- 
side the sample volume. Indeed one may use incomplete 
spheres by counting the number of galaxies inside a por- 
tion of a sphere and weighting this for the cor responding 
volume of the spherical portion ( Kerscher . 19991 ). However, 
this method implicitly uses the assumption that what is 
inside the incomplete sphere is a statistically meaningful 
estimate of the distribution in the whole spherical vol- 
ume. This is incorrect when the distribution presents large 
fluctuations. For example in the part of a spherical vol- 
ume that lies outside the sample boundaries, there can 
be a void or a large-scale structure and in this sit uatio n 
the weighted estimation is biased (jCabrielli et all . l2005l ) . 
When the full-shell estimator is used, one should consider 
that there is an intrinsic selection effect related to the ge- 
ometry of the samples, which are small portions of spheres. 
When r is large only the more dist ant part of the sample 



is explored by the volum e average (jGabrielli et al. . l2005t 



ISvlos Labini et all . 12008 ). Indeed, for large-sphere radii, 
M(r) decreases and the location of the galaxies contribut- 
ing to the average in Eql5] is mostly placed at radial dis- 
tance in the range ^ [i?mMi + r, Rmax — r] from the radial 
boundaries of the sample at [Rmim Rmax]- Given the ge- 
ometry of the samples for large r, galaxies contributing 
to M(r) will also lie toward the center of the spherical 
portion. 

When Eql5] scales as 



N{r) 



(6) 



and D ~ 3, the distribution is homogeneous, while 
for £) < 3 it has long-range power-law correlations 
(jGabrielli et all . 12005 ). The scahng of EqU with L> < 3 
can be interpreted as the signature that the distribution 
is a fractal; however there are point distributions that, by 
construction, are not fractal objects but which may exhibit 
a sca ling of the type given by EqlS] (see IGabrielli et all . 
20051) . 



From EqlBjwe obtain that, in general, for D < i the 
conditional density scales as 



n{r) 



N{r) 



in this situation the average density is not a well defined 
quantity and the sample density is depends on the sample 
size; i.e., it does not give a meaningful estimation of the 
ensemble average density. 

When a distribution is fractal (or generally inhomoge- 
neous) on small scales and homogeneous on large scales, 
then we can ide ntify the homogeneity scale Aq to be the 
scale such that ( Gabrielli et al.l . 2005[ ) 



N{r) 



Vr > An 



(7) 



Depending on the details of the crossover from the 
strongly correlated regime at r < Aq to the weakly cor- 
related one at r > Ao different definitions of Aq can 
be adopted, but they all satisfy the condition given by 
Eql?) For instance one may define such a scale to be the 
one at which the rms fiuctuation have twice the value of 
the average density. If strong clustering occurs only for 
r < Ao < V^l^ it is required to make the statistical anal- 
ysis suitable to describe large fluctuations in this range 
of scales. Instead for Ao < r < V^^^ usual unconditional 
quantities are well-defined and their convergence to the 
ensemble values can be studied. 

The estimator defined by Eq|5| gives the first moment 
(i.e., the average number of points in spheres of radius r) 
of the PDF of conditional fiuctuations f{N] r) computed, 
at fixed r, from the values {-/V(r; i?i)}i=i^..^M(r)- This is 
generally different from the P DF of uncondi tional fluctu- 
ations — considered by, e.g., ISaslawl ( 2000l ) — both for 
homogeneous and inhomogeneous distributions, the dif- 
ference being more important in the former case. 

When a distribution becomes homogeneous; i.e., Eq|7| 
is satisfied, the PDF is expected to converge in a finit e 
volume to a Gaussian functior0 ( Gabrielli et al. . l2005l ): 



/(iV;r » Ao) 



1 



: exp 



27rS2(r) 



[N{r) ~ N{r)Y 



,(8) 



where Y?{r) is the estimation of the variance of the ran- 
dom variable N{r) (see below). 

The second moment of f{N, V) gives the conditional 
variance. For inhomogeneous distributions, this is such 
that 



5{r) 



Y?{r) N{rf-N{r) 



N{r) 



N{r) 



(9) 



where the last equality means that fluctu ations are 



persistent (jGabrielli and Svlos Labinil . l200l[ ). On the 



* This is clearly the case if the number of points is large 
enough, otherwise the PDF is described by the Poisson distri- 
bution 
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other hand, for homogeneous distributions with any 
kind of smaU-amphtude corre lations, we find that 
(iGabrieUi and Svlos Labinil . l200l[) 



6{r)^ « 1 . 



(10) 



When the sample density ns is well-defined; i.e., it does 
not depend on the sample volume, one may define the 
red uced two-point correla tion function that can be written 
as (iGabrielh et al.l . l2005h 



1 dNjr) 1 
Anr^ dr ns 



(11) 



In this case the homogeneity scale can be, for instance, 
defined as the scale beyo nd which £(r) < 1, o r the scale 
such that S'^{Xo) = 1 fsee iGabrielh et alLlioOsh . 



4. Results from the 2dFGRS 

In this section we present the results of the statistical anal- 
ysis of the VL samples of the 2dFGRS catalog discussed 
in Sect 121 We start by presenting the SL analysis to then 
move to the description of the determination of the PDF 
of conditional fluctuations. Furthermore, we consider its 
first moment; i.e., the average number of points in spheres. 
To illustrate the usefulness of the SL analysis, we consider 
its relation to the counts of galaxies as a function of radial 
distance and of apparent magnitude. This allows us to dis- 
cuss in detail the relation between small scale two-point 
correlations and large scale properties of fluctuations in 
the galaxy density field. We then consider the finite-size 
effects that systematically affect the determination of fluc- 
tuations amplitude normalized to the sample density. In 
addition we compute the average of the SL determina- 
tions {A^(?'; i?i)}i=i..M(r) ill bins of radial distance. The 
comparison of the behaviors in the NGC and SGC slices 
allows us to place a lower limit on the homogeneity scale. 

4.1. The scale-length analysis 

In FigsHllH the behavior of the SL analysis is shown in 
the four 2dFGRS samples we considered. One may note 
that in all cases there are large density fluctuations in 
the correspondence of the location of galaxy large-scale 
structures. In FiglSl a three-dimensional plot is shown of 
the same SL analysis reported in FigUl One may see how 
well structures are identified by this analysis. 

The number of points M(r) over which N(r;Ri) is 
computed, as a function of the sphere radius r is shown 
in Figl6l when the sphere radius r gets larger, the M (r) 
decreases quite rapidly. This is due to the geometrical se- 
lection effect previously discussed. In addition, the solid 
angle of the SGC slice is twice that of the NGC slice, and 
thus for r > 20 Mpc/h, there are more center points in 
the SGC samples than in the NGC ones. 




R (Mpc/h) 



Fig. 1. From top to bottom the SL analysis for the sample 
NGC400 with r = 5, 10 Mpc/h. 



C It* 




Fig. 2. The same as FigHlbut now for the sample NGC550 
with r = 10, 15 Mpc/h. 



Let us bricfiy discuss the main features that we detect 
in the various samples H 

— NGC400: there are large structures, which transversely 
cross the sample, at about 240 Mpc/h and at 260, 270, 
290 and 320 Mpc/h. All have approximately the same 
thickness of about 30-40 Mpc/h. When the sphere ra- 
dius is increased to r = 10 Mpc/h the most prominent 
structure remains the one at about 250 Mpc/h, which 
is not sampled anymore when r = 15 Mpc/h. This is 
due to the sphere centers being located toward the far- 
away boundaries of the sample, because of the geomet- 



^ Note that, for instance, I Eke et al] (|2004l '): lEinasto et al.l 

(l2006l ) used different methods to identify the same structures 
we observe. 



Sylos Labini, Vasilyev, Baryshev: Galaxy fluctuations from the 2dFGRS 



7 



500 




40 

X {Mpc/^ 



(Mpc/h) 



60 
50 
40 

30 
20 
10 



Fig. 5. The SL analysis for the SGC550 sample. On the X and Y axes the coordinate of the center of a sphere of 
radius r ~ 10 Mpc/h (centered on a galaxy) is reported and on the Z axis the number of galaxies inside it. The mean 
thickness of this slice is about 50 Mpc/h. Large fluctuations in the density field traced by the SL analysis are located 
in the correspondence of large-scale structures. 



It 




4l> - 




R (Mpc/h) 



Fig. 3. The same as FiglT]but now for the sample SGC400 Fig. 4. The same as FiglUbut now for the sample SGC550 
with r = 5, 10, 20 Mpc/h. with r = 10, 20, 30 Mpc/h. 
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□ □ SGC400 
♦ NGC550 
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io'l=- 





20 

r 



Fig. 6. Number of center-points M(r) as a function of the 
sphere radius r in the various samples considered. 



rical selection effect discussed previously. For r — \h 
Mpc/h, although only a few points (i.e M k. 3000) are 
effectively considered as sphere centers, density fluctu- 
ations are still large, determining variation of a factor 
four in the determination of the number of points in 
spheres at different radial distances. 

- NGC550: the structure at i? « 250 Mpc/h is clearly 
visible even in this sample, and other structures are 
present at larger radial distances R > 350. Mpc/h. 
Fluctuations are still large up to the largest sphere 
radius r = 25 Mpc/h where however the number of 
centers rapidly decreases for the geometrical selection 
effect discussed in Sect. 3 

- SGC400: for sphere radius r = f Mpc/h the situation 
is similar to the NGC400 case, except for the fact that 
the radial distances corresponding to the large varia- 
tions in the density field are different. This shows that 
large-scale structures and the corresponding large fluc- 
tuations detected by the SL analysis are quite typical 
of the galaxy distribution. There are two prominent 
large-scale structures at radial distance of the order of 
R - 320 Mpc/h. Finally for r = 20 Mpc/h the sample 
is dominated by one of the two structures just men- 
tioned, which corresponds to a variation of order five 
in N{r;R). 

- SGC550: a structure at i? « 320 Mpc/h with thick- 
ness of about 40 Mpc/h is present inside this sam- 
ple as well. In addition other structures are visible 
for R > 400 Mpc/h. For the largest sphere radius 
r = 30 Mpc/h, where only a few points are consid- 
ered as sphere centers, fluctuations in N{r; R) are of 
order five and they are due to structures located at 
radial distances R > 380 Mpc/h. 

In summary galaxy distribution in these samples is 
characterized by several large scale structures which cross 
their volumes. These structures are typical and persistent; 
i.e., they are detected at different radial distances and in 



the two different directions toward the SGC and NGC. 
They correspond in the SL analysis to large fluctuations 
in the density field: the thickness of the structures is par- 
tially an intrinsic property and partially due to the 'Finger 
of God' effect — the redshift space extension of large clus- 
ters and super-clusters — convolved with the sphere size 
used to measure the conditional density. Typ ical veloc- 
i ty dis persions are up to ~ lO'^km/s, although lEke et al 



(|2004[ ) conclude that the typical velocity dispersion for 
2dF groups and clusters is 260 km/s, which only corre- 
sponds to 2.6 Mpc/h in an additional radial extension. 
Thus while peculiar velocities can increase the size of ra- 
dial structures, their effect cannot explain the whole range 
of distances which characterize the observed structures. 

The maximum allowed sphere radius we considered, 
which, as discussed, is set by the geometry of the samples, 
is r = 30 Mpc/h. For this value of r we find fluctuations of 
order four in N{r; R) and this allows us to conclude that 
the homogeneity scale is certainly larger than 40 Mpc/h. 
In what follows we reinforce this conclusion by considering 
suitable statistical measurements. 



4.2. Probability distributions of conditional 
fluctuations 

We now turn to the discussion of the PDF of condi- 
tional fluctuations The behaviors in the various samples 
are shown in FigslTlfTOl We limit the discussion to the case 
where the number of determinations {-/V(r, i?^)}^^! ^(j.) 
at flxed sphere radius r is larger than few thousands 
points. For smaller numbers the measurement is affected 
by weak statistics and by flnite-size effects, thus not lead- 
ing to a statistically robust result. As discussed previously, 
when the sphere radius increases, there is a decrease in the 
number of centers, and thus a for large r, whose precise 
value is determined by the geometry of each specific sam- 
ple, the measurement is affected by large statistical and 
systematic effects (see Fig|6l). 

The first point to note is that in all cases the maximum 
of f{N,r) is statistically stable; i.e., it does not change 
when it is computed in the whole sample or in two non- 
overlapping sub samples with equal volume (each half of 
the sample volume) at small (sub-sample 5*1) and at large 
(sub-sample ^2) radial distance (see FigHand FiglS]). 

The tail for large values of N is instead affected by the 
different fluctuations which are present in different sub- 
volumes. The trend is obvious: the larger the fluctuations 
of N{r; R) the more extended toward high N values is 
the tail of f{N,r). In the deepest samples, e.g. SGC550, 
there is a single structure that dominates the distribution. 
However this is placed in the middle of the sample and for 
this reason, apparently, there is no a systematic difference 
in the PDF when this is computed in the two half volumes 
of the sample, one nearby and one far-away. However, it 
is clear that in this situation the shape of the PDF will 
be strongly affected by this fluctuation. Indeed, in each 
sub sample, for the largest sphere radius r we find that 




Fig. 7. Probability density function f{N,r) of the values 
{N{r; i?i)}i=i,..,M(r) for NGC400 in the whole sample and 
the best fit with a Gaussian function. The different sphere 
radius r is reported on the y-axis. The behavior in two 
non-overlapping sub samples (labeled as 5*1 and 5*2 is also 
reported — see text for details. Poisson error bars are 
shown as a reference. 

/(TV, r) is systematically distorted with respect to smaller 
sphere radii. This is because the volume average cannot 
explore the full sample properly because of the geometri- 
cal selection effect which, as discussed, is present in the 
determination of N{r; R). To properly determine the PDF 
on scales r > 20 Mpc/h larger samples are thus required. 

In all cases the PDF is systematically different from 
a Gaussian function, except for the case of NGC550 for 
which there are the weakest statistics, and it is charac- 
terized by a long TV tail which is directly related to the 
large scale structures present in these samples. In addition 
the PDF differs in different samples, especially for r > 10 
Mpc/h. This implies that, because of the weak statistics 
and small volumes, a clean determination of the PDF is 
impossible. For this reason we limit our discussion in what 
follows to the first moment of the PDF, leaving the deter- 
mination of the second moment to the other samples, for 
instan ce those of the SDSS, whe re spatial volumes will be 
larger (Sylos La bini et al.l . 12003 ) . 

In Fig[Tl] we show the collapse plot of the PDF in 
the various samples and for the different sphere radius 
considered. A rough fit of the large N tail is given by 

However, this result requires better samples before being 
confirmed. 



4.3. Conditional average number of galaxies in spheres 

In Fig[T2]we show the determination of the whole-sample, 
average conditional number of points in spheres; i.e., EqlS] 
Given that the PDF in all samples are statistically stable; 



Fig. 8. The same as in FigHfor the sample NGC550 with 
r = 10, 15 Mpc/h. For r = 15 Mpc/h there are only 3765 
determinations. 




Fig. 9. The same as in Figl7]for the sample SGC400 with 
r = 5,10,20 Mpc/h. 

i.e., they do not show important systematic differences in 
different sub- volumes, the full sample volume average pro- 
vides a meaningful statistical qua ntity. These be h aviors 
are the same as the ones found by IVasilvev et al] (|2006l ) 
for the average conditional density. We find that in all 
samples. 



N{r) = Br^ 



(12) 



with D = 2. 2 ±0.1 and B a constant. This result is statisti- 
cally robust up to r = 30 Mpc/h although for the SGC550 
sample the analysis can reach, with a weaker statistics, r ~ 
40 Mpc/h. In all cases, however, the determinations for 
scales r > 20 Mpc/h are done on a relatively small num- 
ber of points (see FigO, so they can be subject to system- 
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To briefly discuss the determination of the con 



Fig. 10. The same as in Figl7]for the sample SGC550 with 
r = 10,20 Mpc/h. 



~ 10" 




NGC400 r=5 
NGC400 1=10 
NGC400 1=5 
SGC400 r=10 
SGC400 r=20 
NGC550 1=10 
NGC550 1=15 
SGC400i=10 
SGC550 1=20 



A' 



Fig. 11. Collapse plot of the f{N,r) in the various sam- 
ples and for the different sphere radius considered. The 
normalization of the different samples has been performed 
in an arbitrary way and on the X-axis there are arbitrary 
units. 



atic fluctuations. The amplitude between the NGC400 and 
SGC400 samples (which have higher number density than 
that of the deeper samples) differs for about ~ 30% (see 
Fig fTS)) . This is due to the different fluctuations, which 
are present in the different volumes, and this implies that 
the samples are not large enough to precisely determine 
the amplitude. This difference is reflected in the number 
counts as a function of apparent magnitude as we discuss 
below. In the NGC550 and SGC550 samples, although the 
samples volume is larger, the constant B in Ea ll2l varies 
- 20% (see FigHS]). 



stant pre-factor i3 in EqfT2l (|jovce and Svlos Labinil . 12001 



Gabrielli et all . l2005l) and the normalization of the condi- 



tional average density in different VL samples one should 
consider the joint conditional probability of finding a 
galaxy of luminosity L at distance r from another galaxy; 
i.e., the (ensemble) conditional average number of galaxies 
{v{L,r))pdrdL with luminosity in the range [L, L -f dL] 
and in the volume clement dr at distance r from an ob- 
server located on a galaxy. We can make the greatly sim- 
plifying assumption that 

{v{y,L))p = ^{L)x{n{y))p, (13) 

where (n(y))p is the average conditional density and 4){L) 
the luminosity function, such that <f>{L)dL gives the prob- 
ability that a randomly chosen galaxy has luminosity in 
the range [L,L + dL]. Although there is clear evidence 
that there is a correlation between them and, for instance, 
that the brightest elliptical galaxies are found in the cen- 
ter of rich galaxy clusters, it has been tested that this is 
nevertheless a reasonable assumption in the galaxy cat- 
alogs available so far. To properly treat the case where 
the correlation between galaxy positions and luminosi- 
ties is taken into account, one would need to use the 
multi-fractal formalism. This goes beyond the s cope of the 



present manuscript (see iGabrielh et al.l . 120051 ). An addi 



tional, much stronger assumption often adopted in Ea ll3l 
is that the space density is a constant; i.e., {n{r)) = const. 
This assumption is for instance the bas is of the so-called 
stand ard minimum variance estimator ([Davis fc Huchra . 
19821 ). It is clear that we avoid making this further as- 



sumption because we want to test whether the space den- 
sity is (or can be approximated by) a simple constant. 

Using Eq |13l we may write the conditional average 
number of galaxies as a function of distance (in case 
D =const.) 



{N{r;Ll.L<L<LlL))p 



W{y,L))pdLdy 



(14) 



VL 

^1, 



' {(j)iL))pdL = Bvlt'' 



where N{r]Ly^ < L < Lyj^) is the number of galaxies 
in a sphere of radius r and with intrinsic luminosity in 
the range [Ly]^,LyjJ\, and Bvl is the amplitude of the 
number counts in the VL sample with these limits in ab- 
solute luminosity. Because the luminosity function has an 
exponential cut-off at L*, VL samples containing brighter 
galaxies show a smaller Bvl- By knowing the shape of the 
luminosity function, it is simple to normalize the differ- 
ent B y L in different VL samples (jjovce and Svlos Labinil . 
I2OOII) . 



4.4. Average in radial bins 

We can now use the data obtained by the SL method to 
investigate whether there is a convergence to homogeneity 
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Fig. 12. Average number of points in spheres of radius r 
around a galaxy. The difference amplitude in samples with 
different limits in absolute magnitude is simply ascribed 
by the effect of the luminosity function (see text). Error 
bars are estimated by the sample dispersion on the average 
value. 



NGP400 
SGP400 
NGP550 
SGP550 



r (Mpc/h) 

Fig. 13. The same as in FiglT2]but divided by the best-fit 
power-law behavior r^'^^. The variation in the amplitude 
B in Eq[T2]is clearer in this representation. The determi- 
nation for r > 20 Mpc/h is subject to systematic fluctua- 
tions, due to the limited volume and the weaker statistics. 



on some large scales r > 30 40 Mpc/h; i.e., the largest 
sphere radius allowed by the geometry of the samples. This 
is done as follows. We divide the whole range of radial 
distances in bins of thickness AR and compute the average 



1 



Nir;Rj) 



(15) 



where the sum is extended to the Alf, determinations of 
N{r;R), such that the radial distance is in the interval 
range [R, R + AR] . It is clear that for this measurement 



Fig. 14. Behavior of EqUHwith r = 5 Mpc/h for NGC400 
and SGC400 and AR = 50 Mpc/h. The error is computed 
through EqfTBl 



we have to use a small r to avoid overlapping in space 
between neighboring bins in radial distance. The variance 
on the mean is 

(N{r;R,)-N{r;R,AR)Y 

E{r;R,ARy= ^ ^ __ _ ^ .(i6) 

Rje[R,AR] 



MbiMb ~ 1) 



The error analysis in Eg fTBl assumes that the N{r; Rj) are 
independent, but in fact they are correlated. The errors on 
these points are therefore substantially under-estimated. 
However if there is a trend toward homogenization, the 
error caused by neglecting this correlation will be smaller 
hence the under estimate of the error bars. Only for the 
case of a highly correlated distribution does EqlH] under- 
estimate the error bars, which than represent a lower limit 
of the "true" error bars. 

The quantity given by Eq[T3]and its error fEgfTB]) pro- 
vide an estimation of the number of points in spheres of 
radius r averaged in thickness bin AR. We expect that, if 
the distribution converges to uniformity on a scale Aq , then 
correspondingly N{r; R, AR > Xq) does not show large 
fluctuations as a function of R. 

Results for the four samples are shown in FigsfTH- 
[T5] One may note that, for the largest radial bin chosen 
AR — 75 Mpc/h, there is no trend in homogenization, but 
instead the measurements in bins centered on different R 
wildly scatters; i.e., their values are outside the statisti- 
cal error bars given by EqdH This shows that large-scale 
structures have an amplitude that is incompatible with ho- 
mogeneity on scales smaller than Aq = AR = 75 Mpc/h. 

It is interesting to note that the large fluctuations be- 
tween the NGC and SGC samples cannot be generated 
by some redshift-dependent effect, such as the inclusion 
of galaxy evolution in the computation of absolute magni- 
tudes. Indeed, such a correction, which is expected to be 
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Fig. 15. As in FiglHbut for NGC550 and SGC550 with 
r = 10 Mpc/h and AR = 50 Mpc/h. 

small anyway, given that the redshifts involved do not ex- 
ceed 0.2, would affect both samples in the same way. Thus 
by comparing the estimation of the density in bins in the 
same range of radial distances, we can conclude that the 
fluctuations we have detected are intrinsic to the distri- 
bution of galaxies in these samples. A similar argument 
can be made for the effect of different cosmologies in the 
computation of the metric distance. 

4.5. Radial counts in VL samples 

A complementary way to study fluctuations on large scales 
in galaxy redshift surveys is represented by the deter- 
mination of the radial counts; i.e., the counts of galax- 
ies as a function of the radial d istance in VL samples 
(jGabrielli and Svlos Labinil . 120011) . In order to have a sta- 
tistical estimator and to evaluate fluctuations, we divided 
the angular area of the samples into Nf — 20 non overlap- 
ping sub fields of equal solid angle. For each we compute 
the differential radial density Ui {R; AR) in bins of thick- 
ness AR ~ 10 Mpc/h , where i*'* labels the sub field and 
R is, as usual, the radial distance. We then can compute 
the average 



1 



Nf 



n(i?;Ai?) = ^^n,(i?;Ai?) 

/ ,-—1 



(17) 



and the sample variance 



aliR; AR) 



Nf 



Nf 



1 ^ 

i=l 



ni{R;AR) - n(i?; Ai?))^ . (18) 



Results are shown in Figs fTBlflTl In the NGC400 sample 
the structure at 250 Mpc/h is visible as a relatively large 
fluctuation of n{R; AR) and a correspondingly large error. 
This means that this structure partially covers the angu- 
lar area of the survey. In the NGC550 sample, the radial 
density is flatter, although there is a large dispersion. In 




NGC40U I - 




200 250 300 

R (Mpc/h) 



Fig. 16. The average differential radial density n{R; AR) 
(EqdTl) and its error (Eg llSp for the samples NGC400 and 
SGC400. 
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Fig. 17. The same as in FigHHfor SGC550 and NGC550. 

the SGC400 sample, the structures at 160 Mpc /h and 320 
Mpc/h are identified as local enhancements of n{R; AR). 
The same occurs for the SGC550 case, where the same 
two structures are visible. By comparing Figs ll6IIT7l with 
FigsHHHone may note that the SL analysis is a much more 
powerful method than the simple counting as a function 
of radial distance in tracing large-scale galaxy structures. 

4.6. Redshift distribution in the magnitude limit 
sample 

By studying the redshift distribution in the 
Durham/UKST Galaxy Redshift Survey, fluctua- 
tions have been found in the observed radial density 
functio n are close to 50% occurring on ~ 50 M pc/h 
scales (iRatcUffe et all . Il998t iBusswell etaP . l2004[ ). In a 
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Fig. 18. Upper panel: Radial density in bins of thickness 
10 Mpc/h in the NGC magnitude limited sample. The 
most prominent features identified by the iV(r; R) anal- 
ysis are also visible by the simple counting. There is a 
large structure at ^ 240 Mpc/h. By comparing this figure 
with Figs[l]|4]one may appreciate the usefulness of the SL 
method in tracing structures Bottom panel: The same for 
the SGC magnitude limited sample. Here there are large 
structures at ^ 180 Mpc/h and - 320 Mpc/h. 



similar way in the 2dFGRS (IBusswell et al.l . 120041 ) . two 
clear "holes" in the galaxy distribution were detected in 
the ranges 0.03 < z < 0.055, with an under-density of 
~ 40%, and 0.06 < z < 0.1 where the density deficiency is 
~ 25%. These two under-densities, detected in particular 
in the the 2dFGRS southern galactic cap (SGC), are 
also clear features in the Durham/UKST survey. Given 
that the 2dFGRS SGC field is entirely contained within 
the areas of sky observed for the Durham/UKST survey, 
the similarities in the redshift distributions are both 
proofs of the same f eatures in the galaxy distribution 



( Busswell et al.U2004 ) 



We can now compare the redshift distribution in the 
magnitude limit sample with the results obtained by the 
SL analysis. In Fig[TH]we report the counting of galaxies 
as a function of the radial distance, in bins of thickness 
10 Mpc/h, in the whole magnitude limited samples. It is 
interesting to compare these behaviors with Figs[T][31 The 
SL method clearly identifies the same structures, which 
are visible in FiglTS] as peaks of the radial distribution. 
However the SL method is able to quantify the ampli- 
tude of these fluctuations and, by applying the statistical 
analysis presented above, to determine how typical these 
structures are. 

4.7. Magnitude counts 

In FiglT9] we report the differential counts of galaxies as 
a function of apparent magnitude in the S GC and NGC 
The behaviors are similar to those found bv lNorberg et al 



Fig. 19. Differential counts of galaxies, in bins of Am = 
0.25, as a function of apparent magnitude in the SGC and 
NGC. A reference line corresponding to N(m) ^ iQ^m 
with a = 0.5 is reported. 



(|2002ai ): iBusswell et al.l (|2004l) : the former paper con- 
cluded that there is conclusive evidence that counts in the 
SGC are down by 30% relative to the NGC counts. We 
find the same difference and, as discussed, we can directly 
relate it to the large-scale structures present in both sam- 
ples. Indeed, as already discussed, the amplitude of the 
conditional number of galaxies (i.e., EqfT2|) is ^ 20-^30% 
higher for the NGC samples than for the SGC ones. In 
other words, in the NGC samples there are more struc- 
tures, hence fluctuations in the N{r; R), than in SGC sam- 
ples, as can be seen by comparing, for instance, FigsHJH 

4.8. The two-point correlation function 

The standard way to measure two-point correla- 
tions is accomplished by determi ning the function 

gi ven bv EalTTl — s ee e.g ., 
19691): iDavis and Peebie^ (llQSsI); 



Benoist et al 



Norberg et al 




(Il99f 
(I2OOIL 



Totsuii and Kihara 



.Zehavi et al.l 
2n02b[ ) 



Park et al.l 
(12001 



1994 ): 



20041 ): 



As already mentioned 
when measuring this quantity it is implicitly assumed 
that the distribution is homogeneous well inside the 
sample volume; i.e., Aq ^ V^^"^. Let us see what happens 
when, inside a spherical sample of radius Rs there is 
a fractal distribution with dimension D < 3. We may 
estimate the sample density ns (which is not an average 
quantity) by 



ns 



N 
V 



3B 

An ' 



R 



D-d, 



(19) 



where, in the second equality on the r.h.s., we used EqfT2l 
for the number of points in spheres. Equation [19] shows 
that the sample density depends on the sample size when 
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D < 3. The estima tor of the two-point co rrelation function 



can be written as (iGabrielh et al.l . l2005h 



+ 1 = 



iV(r, Ar) 1 
y(r, Ar) ns 



(20) 



The first ratio in the r.h.s. of Eql^Ol is the average con- 
ditional density; i.e., the number of galaxies in shells of 
thickness Ar averaged over the whole sample, divided by 
the volume V{r, Ar) of the shell. The second ratio in the 
r.h.s. of Eq[2Q]is the sample density. By using the second 
equality on the r.h.s. of Eq[Tn] for ns and Eq[T2] for the 
conditional number of points in spheres, we find 



D-3 



1 



(21) 



which shows that the amplitude of ^(r) depends on Rg for 
D < 3] i.e., as long as the conditional density is a power 
law as a function of scale . 

Equation [21] has been obtained by making the assump- 
tion that the estimation of the sample density is given by 
the second equality in the r.h.s. of Eq[T9l This is gener- 
allyot the the sample density in inhomogeneous 

distributions is subjected to fluctuations of order one on 
the scale of the sample size. Thus the behavior given by 
Eq l21l should be interpreted as giving a very rough esti- 
mation of the amplitude of £,{r) in a finite sample when 
there is a fractal distribution inside it. 

It is worth noticing that, if the conditional density is 
a power-law function of scale, then ^(r) is not a power- 
law over the same of scales of the conditional density, and 
particularly it does not have the same power law index 
. Indeed, as shown by EqUIl S,{r) has a a break of the 
power law and it is possible to compute analytically the 
exponent of ^(r) as a function of the expo nent of the con- 



dition al density and the scale ratio r/Rs (iGabrielli et al 



It is easy to show that on scales r/Rs <^ 1 the cor- 
relation exponent measured by the two-point correlation 
analysis coincides with what is measured by the condi- 
tional density, while on larger scales the exponent mea- 
sured by the ^(r) analysis is general ly smaller than D — 3 . 
This is indeed the result obtained bv lHawkins et al. ( 2003[ ) 
(see their Fig. 6). 

Let us now evaluate the sample density simply as 
ns = N/V in SGC400 and NGC400. Given that the sam- 
ple geometry is a sphere portion, the volume is given by 



V2dFGRS - {Rmax ^ ^1 



(22) 



By using the parameters of the VL samples (see Table [T]) 
we obtain respectively in the SGC400 sample 



n 



sgc 



7.0 • 10" 



Pair-counting based estimators, like the Davis & Peebles 
and/or the Landy & Szalay estimator, have the same patholo- 
gies whe n the conditional den sity is a power law as a function 
of scale (|Gabrielli et al.1 . |2005| ). 



■ NGC400 
> SGC400 



r(Mpc/h) 

Fig. 20. Standard two-point correlation function in the 
SGC400 and NGC400 sample estimated by EqM The 
sample density is simply computed as N/V. 

galaxies per (Mpc/h)'^ and in the NGC400 sample 



ri;i^= 1.0-10-' . 

Thus there is a 30% difference in the amplitude. When 
we normalize the conditional density in the NGC400 and 
SGC400, which as discussed have the same ~ 30% differ- 
ence in amplitude, to the respective values of the sample 
density we get the amplitude of ^(r) is nearly the same 
(see Fig |20| . This is because both the nominator and de- 
nominator in the r.h.s. of Eg 1201 vary in the same way, hav- 
ing a difference of about 30% in both samples. Thus by 
measuring the amplitude of fluctuations normalized to the 
sample density the information about the large variations 
in the two different slices is lost. This does not, however, 
imply that fluctuations are small beyond ro ~ 6 Mpc/h 
as one would conclude by only considering the two-point 
correlation function. Actually, as we discussed in the pre- 
vious sections, fluctuations are large and persistent over 
the whole volume of these samples. It is indeed the nor- 
malization of fluctuations amplitude to a not well-defined 
quantity, the sample density, that in this case gives rise 
to the determination of the length scale rg « 6 Mpc/h, 
which is however a spurious quantity. The volume of the 
SGC400 sample is larger than that of the NGC 400 s ample: 
this is reflected in the fact that the break of ^ (r) occurs 
at a longer distance than for the NGC400 sample. For the 
samples NGC550 and SGC550, there is a similar situation, 
but the variation of both the sample density and of the 
amplitude of the conditional density is of about 20%. 

Different estimators of the two-point correla- 
tion function, such as the Davis and Peebles (DP) 
(|Davis and Peebles!. Il983l) estima t or and the Landy and 
Szalay (LS) ("Landy and SzalaV, ^ 1993 ) estimator (see 
discussion in Sylos Labini and Vasil yev 12008) for more 
details about the different estimators), lead to an estima- 
tion of the amplitude of ^(r), which agrees with the one 
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Fig. 21. Standard two-point correlation function in the 
SGC400 measured by means of different estimators, 
namely the full-shell (FS), the Davis and Peebles (DP) 
and the Landy and Szalay (LS). 

just discussed above. For instance in Fig |21l a comparison 
is shown of the different estimators. It is worth noticing 
that the different estimators give the sample amplitude 
of £,{r), but they differ in the scale at which ^(r) has 
the break in the power-law behavior. This is explained 
by the different ways the estimators treat the boundary 
conditions, and, particularly, include (implicitly) the 
global condi tion known as integral constra int — see 
discussion in lSvlos Labini and Vasilve 

The sample density ns can be estimated differently 
from EqfT9l The only condition that it is required for 
Eql2Q]to be a valid estimator is that the size r* of the vol- 
ume entering in the denominator of Eq |19l be larger than 
the homogeneity scale Xq. In fact, when r* Aq the esti- 
mation of the sample density does not differ substantially 
from its ensemble average value because, in this situation, 
the amplitude of the two-point correlation is, by definition, 
much smaller than unity. Thus we may consider another 
estimator of the two- point correlation fu nction, which is 
the one introduced bv lSvlos Labini et al. |2008.1 



e(r; i?, Ai?) + 1 



iV(r,Ar) V{R,AR) 
Vir,Ar) N{R,AR) 



(23) 



where the second ratio on the r.h.s. is the density of points 
in a shell of thickness r* = AR around the radial distance 
R. If the distribution is homogeneous; i.e., r* > Aq, and 
statistically stationary, Eql23] should be statistically inde- 
pendent of the range of radial distances {R, AR) consid- 
ered. In Fig[221 we show the determination of Eq[221 for 
different values of R, AR in the two deepest samples. The 
amplitude of ^ (r) systematically depends on the choice of 
the normalization and this is just the imprint of the large 
scale inhomogeneities present in these samples. 

In summary, even though the fact that there is a rel- 
atively large difference in the densities between the NGC 



Fig. 22. Standard two-point correlation function in the 
SGC550 and NGC550 samples estimated by EqEl The 
sample average density is computed in spheres of radius r* 
and considering all center points lying in a bin of thickness 
AR centered at different radial distances R. 

and the SGC, the amplitude of the correlation function is 
similar because it is measured with respect to a varying 
density; i.e., its value reflects the assumption of homogene- 
ity which is used in the definition in the ^-analysis. Only 
by analyzing fluctuations that are not normalized to the 
sample density one can detect the effect of the large spatial 
inhomogeneities characterizing these galaxy samples. 

4.9. Analysis in the catalog with conservative cuts in 
apparent magnitude 

We now briefly discuss the results of the SL analysis for the 
VL samples obtained with conservative magnitude cuts 
(see Table[2]). In Figl23]wc show the results of the SL anal- 
ysis: by comparing this figure with FigslT][4] by a simple 
visual inspection one note that structures are extremely 
similar. Clearly, because of the fewer points contained in 
the conservative-cuts samples, the value of N{r; R) is dif- 
ferent. The behavior of the PDF for the various samples is 
shown in Figl23l The results are thus statistically stable. 

5. Comparison with mock-catalogs 

Standard theories of galaxy formation assume that fluctu- 
ations in the matter de nsity field in the early universe h ave 
very small amplitude, (jPeeblesl . IPeacockL Il999l ). In 

this situation, by denoting as P{k,t), the PS of matter 
at an arbitrary time t ^ 0, where t = t* corresponds to 
some early times in the universe as we discuss below, the 
prediction of standard cosmological models can be written 
in this form 



P{k,t) = AP{k,t*)g(k,t) . 



(24) 



It is possible to write P{k,t*) = Phz{k)T'^{k) where 
Phz{k) is the Harrison-Zcldovich (HZ) PS, and T{k) is 



16 



Sylos Labini, Vasilyev, Baryshev: Galaxy fluctuations from the 2dFGRS 



S50 
_^ 200 

a: 

i.' 150 
100 





250 
200 
ISO 
100 



-| ■ NGC4QQr^j(l 



50 



- | ■ SGC400I-JO 





100 150 200 250 300 350 
R(MpcJll) 




200 L' 



3m 4(Xi 

R(Mpc/h) 



Fig. 23. SL analysis for the samples with conservative ap- 
parent magnitude cuts. The value of the sphere radius is 
reported in the captions 



the relative mass variance shows the fastest possible decay 
as a function of scale ^'^^{r) oc 1/r'^. The PS must have a 
cut-off at some high values oi k; i.e, on small scales. This is 
provided by the transfer function T{k), which can be de- 
termined given the supposed properties of (dark) matter. 
For CDM this is such that at small k the PS maintains 
the HZ tail, w hile at large k it approximately decays as 
(|Peacocla ,1999). Correspondingly, the two-point cor- 
relation function is positive for r < Vc and negative for 
r > Tr. with a power- law tail of the type ^(r) ^ — r~'* 



(|Svlos Labini and Vasilvevl l2008h 



Gravitational clustering in the linear regime, in an 
Einstein de Sitter cosmology, is characterized by a grow- 
ing a nd a decaying mode, both of them power laws in 
time (jPeeblesl . [l980l) g{k,t) = g{t) oc t^/^. The ampli- 
tude A in Eql2ll is determined from the observations of 
CMBR anisotropics and from the theoretical assumptions 
on the nature of cosmological dark matter. From the time 
dependence of P(fc, t), it is possible to derive the time de- 
pendence of A™(t), defined to be fT^(A™) = 1. This grows 
as a power-law function of time as well: particularly for 
power-law PS: i.e., P{k) ^ fc" and n < 4, one obtains 



(IPeeblesl . Hoi 




(25) 
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Fig. 24. PDF of the samples with conservative appar- 
ent magnitude cuts. The value of the sphere radius is re- 
ported in the labels. In the comparison we used the nor- 
malized variable by using the transformation described 
by Eas l26ll27l 



a transfer function that depends on the type of cou- 
pling between matter and radiation in the early uni- 
verse (jPeacockl Il999t ). The HZ PS is predicted by in- 
flationary theories to be the outcome of the exponential 
amplification of quantum fluctuations occurring in the 
early phases after the Big-Bang, and this is such that 
Phz{k) oc fc. A PS behaving as P{k) oc A;" on small k 
with n > 1 belongin g to the class of super - homo geneous, 
or hyper -unif orm ( Torquato and Stillingeii . distri- 
butions (|Gabrielh et al.l . l2002h . Their main characteristic 
is that density fluctuations are of surface type; i.e., that 



To summarize the situation, we may identify two length- 
scales at the present time. The first is the homogeneity 
(or nonlinearity) scale A™: for r < A™ nonlinear cluster- 
ing took place and thus it changes the shape of PS in a 
non-linear way. On the other hand for r > A™, fluctua- 
tions are still in the regime today, and linear perturba- 
tion theory predicts a simple linear amplification of pri- 
mordial correlations. From the normalization of the initial 
am plitude A to the CMB R anisotropies, AJf ' « 10 Mpc/h 
(Springel et al.l . [20051 : ISpergel et all . 120071 ). In addition. 



from the properties of the PS of CMBR anisotropies it is 
derived that the second intrinsi c length scale of these mod- 
els is about rc ~ 100 Mpc/h (jSvlos Labini and Vasilvev . 



20081) 



In order to study gravitational structure formation in 
the nonlinear phase, the common practice is to perform 
N-body simulations of theoretical models. This is done by 
integrating the equation of motions of N self-gravitating 
particles, in a volume V and by making use of periodic 
boundary conditions to represent an infinite (periodic) 
system. Initial particle correlations are given according to 
a given theoretical model, and the initial redshift is gen- 
erally z > 10. The simulation is then run up to z = 0. 
In addition the space background is expanding and thus 
one follows the particles' motion in comoving coordinates. 
Particles are supposed to simulate the motion of fluid el- 
ements of the underlying dark matter fleld. 

To identify galaxies one uses a phenomenological ap- 
proach. As discussed, galaxies are supposed to form in 
the highest density peaks of the dark matter fleld. Thus 
when the simulation has reached the redshift z = 0, one 
uses semi-analytic models to identify galaxies. Among the 
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larg est simulations m ade publicy available, the millennium 
run (jSpringel et al. I. [2005 1 used more than 10 billion par- 
ticles to trace the evolution of the matter distribution in a 
region of the universe in a cubic box of 500 Mpc/h. Semi- 
analytic catalogs constructed fro m the millennium r un 
contains about 10 million objects (jCroton et al l l2006l) . 

Here we analyze the semi-analytic catalog containing 
9,925,229 objects in which the absolute magnitudes of 
mock-galaxies are given in the BVRIK filters. To repro- 
duce the same limits in absolute magnitude of the volume- 
limited samples of the 2dFGRS, we used the relations be- 
tween magnitude in different filters given bv lColless et al 



(|200lh . In this way we selected respectively 1,119,434 and 
368,619 galaxies in a 500 Mpc/h cube. We then selected 
three slices with the same geometry of the real 2dFGRS 
samples, which are hereafter called SGC400m, NGC400m 
and NGC550m. The remaining sample, SGC550m, is 60° 
wide instead of 84o as in real data. The number of ob- 
jects in each of the four samples is close to the one in the 
corresponding real 2dFGRS sample. 

In Fig[25] we show the behavior of the SL analysis for 
the mock-sample SGC550m in real and redshift space for 
r = 20 Mpc/h (see for comparison Figsl2][4]). The effect of 
peculiar velocities is that of enhancing a little the struc- 
tures that appear in real space. That the difference be- 
tween real and redshift space is small for sphere radius 
r > 10 Mpc/h is shown in Figs E51l?71 where we plot the 
PDF of conditional fluctuations. In this case we used the 
normalized variable 



N,ir)-N{r) 



and we thus determine its PDF, that is. 



(26) 



(27) 



where f{N,r) is the PDF of the v ariable Ni{r), N{r) its 
estimated first moment, and E(r) its estimated standard 
deviation on the scale r. For sphere radius smaller than the 
homogeneity scale predicted by theoretical models; i.e., 
Aq' = 10 Mpc/h, the PDF has a relatively large tail for 
high values of N, although around the peak it is well fitted 
by Gaussian function. For r > 10 Mpc/h, the PDF rapidly 
converges to a Gaussian and already for r = 20 Mpc/h the 
fit with a Gaussian function is extremely good also at high 
N. Correspondingly the plot of the SL analysis does not 
show the presence of large amplitude fiuctuations. 

In FigsHBlUn] we show the comparison between the 
PDF in the mock-sample SGC500m (redshift space) and 
the one in the real sample SGC550. Both for the sphere 
radius r = 10 Mpc/h and r = 20 Mpc/h, fiuctuations are 
more persistent in real samples than in mock-samples and 
this is refiected in the relatively fat tail of the PDF for 
large N values. 

In FiglSO] (upper panel) it is shown the behavior of the 
whole-sample average number of points is spheres; i.e., 
EqU For r > Ag" = 10 Mpc/h, the function N{r) growths 
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Fig. 25. SL analysis with sphere radius r = 20 Mpc/, 
for the mock-sample SGC550m (the samples with better 
statistics) in real space (upper panel) and in redshift space 
(bottom panel). 
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Fig. 26. PDF for the mock-samples SGC400m and 
NGC400m in real space (RS) and redshift space (ZS) for 
r = 10 Mpc/h. The best fit with a Gaussian function is 
reported as reference. 

with an exponent equal to the space dimension (i.e., D = 
3). On smaller scales the exponent is instead D — 1.2±0.1. 

Finally in the Figl30] (bottom panel) the behavior of 
of Eq[T5]with r = 5 Mpc/h is shown for the mock-galaxy 
samples in real space. Fluctuations, when averaged over 
radial bin of thickness AR = 50 Mpc/h are more smooth 
and quiet than the real samples and thus that there are 
no large fiuctuations on scales larger than A™. This is 
clearly consistent with the behavior of the PDF described 
previously. The small difference for small R is due to the 
lower statistics at the nearby boundary of the samples. 

Because the distribution becomes uniform at scales r > 
Aq ~ 10 Mpc/h, the two-point correlation function is well- 




Fig. 27. The same as in FigM but for the case r = ^'S' The same as in FiglM]but for the case r - 20 

20 Mpc/h and for the mock-samples SGC550m and Mpc/h and for the sample SGC550. 

NGC500m. 
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Fig. 28. Comparison between the PDF in the mock- 
sample SGC400m (redshift space) and the one in the real 
sample SGC400. Poisson error bars are displayed as a ref- 
erence. 



defined, both in real and redshift space. This presents an 
intrinsic length scale ^(ro) = 1 at about tq « 6 Mpc/h, 
and beyond that scale it rapidly decays to zero. 

The whole-sample average number of points in spheres 
(i.e., EqlSI) does show a systematic difference with respect 
to the real-space case. Indeed, while for r > A™ = 10 
Mpc/h it again shows a power-law behavior with Z? = 3, 
on smaller scales the exponent is D = 2.2±0.1 instead of 
D = 1.2 as for the real space case (see FiglST] — upper 
panel). In addition, the transition between the two regimes 
is faster than for the real space case. 

Finally in Fig [21] (bottom panel) the behavior of EgfTSl 
is reported for the mock-samples in redshift space. As for 
the real-space case, and consistent with the behaviors of 
the PDF in redshift space, fluctuations are substantially 
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Fig. 30. Upper panel: Whole-sample average number of 
points in spheres; i.e., Eq[5]for the mock-samples in real 
space. Bottom panel: Behavior of Eq |15l for the mock- 
galaxy samples in real space for SGC550m; the sphere 
radius is r = 10 Mpc/h and the radial bin thickness is 
Ai? = 75 Mpc/h. The error is computed through Eg [TBI 



smoothed when averaged over radial bins of size Ai? > 



6. Discussion and conclusions 

We characterized fluctuations and correlations in the 
galaxy distri bution, by considerin g several samples from 



the 2dFGRS (jColless et al.U200lh . The main point of our 



analysis is to use statistical quantities that measure fluctu- 
ations amplitude not normalizing it to the sample average. 
Our main results are: 
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Fig. 31. Upper panel: Whole-sample average number of 
points in spheres; i.e., EqlSl for the mock-samples in red- 
shift space. Bottom panel: Behavior of Eq[T5]for the mock- 
samples in redshift space for SGC550m; the sphere radius 
is r = 10 Mpc/h and the radial bin thickness is Ai? = 75 
Mpc/h. The error is computed through Eg [TBI 



Estimations of the sample average are affected by sys- 
tematic variations that are the imprint of the struc- 
tures with the large spatial extension present in these 
samples, which correspond to large amplitude fluc- 
tuations. To identify structures in redshift space we 
applied the scale-length (SL) method, introduced in 



Svlos Labini et all (|2008l ). 



The probability density function (PDF) of conditional 
fluctuations is, in all considered samples and up to 
r = 20 Mpc/h (the largest distance scale we considered 
for this statistics), wildly different from a Gaussian 
function and characterized by a long tail, possibly with 
a power-law decay, signaling the presence of large fluc- 
tuations (e.g. structures) in the density field. These 
fluctuations are persistent up to scales limited by sam- 
ple sizes. 

The first moment of the PDF shows scaling properties, 
that correspond to a fractal dimension D — 2.2 ±0.1 
up to r « 40 Mpc/h. At the largest distance scale we 
considered; i.e., for r > 20 Mpc/h, determinations are 
affected by relatively large fluctuations that reflect the 
intrinsic systematic error induced by structures, and 
thus the results have weaker statistical significance for 
the limited volume and number of points contributing 
to the average. This ag rees with our previous findings 



mm . 

The homogeneity scale, i.e. the scale beyond which 
fluctuations become small and the sample density es- 
timation approaches the value of the (ensemble) aver- 
age density, has a lower limit at Aq = 70 Mpc/h. This 
is because fluctuations on scales of about 70 Mpc/h 
are incompatible with a transition to homogeneity on 



th at scale. This r e sult i s consistent with the r esults 
bv iBussweU et al.l (|2004[) : iRatcliffe et all (|l998[) . The 
main results obtained in this survey are in very good 
agreement with a similar analysis of galaxy samp les 



extracted from the SDSS (|Svlos Labini et all . l2008f ). 



— The SL method is suitable and effective in tracing 
structures in redshift space and in measuring the 
amplitude and distribution of density fluctuations. 
Galaxy counts results can be easily related to the re- 
sults obtained by the SL method. Indeed we confirmed 
that the variation of about ^ 30% between galaxy 
counts as a function of apparent magnitude in the 
northern and sout hern angu l ar reg ions of this survey, 
detected by iBusswell et al.l (|2004l ). can be easily re- 
lated to the different structures present in the differ- 
ent samples. Moreover, such large fluctuations, make 
any estimation of the sample average, on comparable 
scales to the samples sizes, affected by strong system- 
atic effects. 

— Apparently the amplitude the two-point correlation 
function, which normalizes the amplitude of fluctua- 
tions (as a function of scale) to the estimation of the 
sample average, of is not affected by fluctuations, and 
that this is stable in the Northern and Southern sam- 
ples. This occurs, however, because of the normaliza- 
tion to the sample average (i.e., the denominator of 
^(r)) which indeed shows the same large variation be- 
tween the two angular regions as the conditional den- 
sity (i.e., the nominator of £,{r)). Thus the normal- 
ization of fluctuations amplitude to the sample aver- 
age hides the presence of large differences between the 
two samples, being biased by uncontrolled systematic 
finite-size effects. In this situation the amplitude of 
the two-point correlation function does not give a re- 
liable and meaningful estimation of either the statisti- 
cal properties of galaxy distribution in these samples, 
the spatial extension of correlations, or of amplitude 
of fiuctuations. It is indeed the normalization of fluc- 
tuations amplitude to a not well deflned quantity, the 
sample density, which in this case determines the spu- 
rious length scale rp « 6 Mpc/h. 

— We compared the results by studying conditional fluc- 
tuations in the 2dFGRS samples with those obtained 
in mock-galaxy catalogs. These are constructed from 
the dark matter density field in cosmological simula- 
tions which were evolved to study the effect of non- 
linear gravitational clustering in an expanding uni- 
verse. We used the mock-galaxy samples provided by 
Croton et al. I (Sooi) and obtained from the largest N- 



body simulations pub licly available at the moment 



( Springel et al. . 20051 ). We have studied the mock 



galaxy catalogs both in real and redshift space, so to 
take into account the distortions due to the effect of pe- 
culiar velocities. We found that the PDF of conditional 
fluctuations is weakly affected by peculiar velocities 
and that it rapidly converges to a Gaussian function 
on scales r > 10 Mpc/h, in agreement with predictions 
of theoretical models both in real and redshift space. 
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On smaller scales, i.e. r < 10 Mpc/h, the PDF presents 
a longer tail than the Gaussian function. This is the 
signature of the nonlinear structures formed at small 
scales. Indeed, the PDF of mock-galaxy catalogs re- 
flect the dynamics that has given rise to structures in 
the simulations. In this respect, correlations provided 
by theoretical models and given as initial conditions 
for the simulations are of CDM type. In these models 
gravitational clutering forms nonlinear structures (i.e., 
large fluctuations in the density field) up to the limited 
range of about 10 Mpc/h. Such a length scale can be 
easily predicted by considering the initial amplitude 
of fluctuations in these models, normalized to CMBR 
anisotropics, and by computing the evolution in an ex- 
panding universe of t he linear regime of gr avitational 
clustering. Note that lEinasto et al. (200a) reached a 
similar conclusion; i.e., that in the 2dFGRS there are 
several rich super-clusters which is higher than what is 
found in similar volumes in the Millennium simulation. 



Our conclusion is therefore that nonlinear structures 
(i.e., large fluctuations) predicted by models extend to 
scales that are much smaller than the ones we found 
in the 2dFGRS sample and that amplitude of fluctua- 
tions on large scales, i.e., r > 10 Mpc/h, is large in 
the real samples and small in the mock-galaxy catalogs. 
Thus standard models of galaxy formation, with CMBR 
normalization, are unable to form large-scale structures 
such as the ones present in the observed galaxy distribu- 
tion. In addition, standard models, beyond the value of 
the homogeneity scale, predict a second intrinsic length 
scale represented by the distance scale at which the 
two-p oint correlation function changes sign ([Durrer et al 



2002 



f omt correlation lunction cnanges t 
Svlos Labini and Vasilvevl l2008l) 



i.e., marking the 
transition from small amplitude fluctuations with weak 
positive correlations to the regime where there are anti- 
correlations (and fluctuations of extremely small ampli- 
tude). This length scale is about Tc « 100 Mpc/h in cur- 
rent models; i.e., slightly larger than the lower limit to the 
homogeneity scale Xq = 70 Mpc/h we can place by study- 
ing the 2dFGRS samples. Forthcoming galaxy samples will 
allow us to determine whether there will be a transition to 
super-homogeneity (negative correlations) and on which 
scale this eventually occurs. 
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